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After the discovery of a light and likely fundamental Higgs boson during the LHC Run I [1, 2], the 
test of the Standard Model nature of this Higgs boson will be one of the key goals of the upcoming 
LHC run(s). One of the most interesting parameters of the Standard Model (SM) is the top Yukawa 
coupling y t . One reason is that, because of its large size, it dominates the renormalization group 
evolution of the Higgs potential to higher, more fundamental energy scales [3]. On the other hand, 
this coupling is one of the hardest to directly determine at colliders [4, 5], because this requires a precise 
measurement of the ttH production cross section. This cross section can in principle be measured at 
hadron colliders [6, 7, 8] as well as at e + e _ colliders [9, 10]. However, a suitable e + e _ collider should 
at least have an energy of 500 GeV. If a future e + e^ Higgs factory should have lower energy, the 
precise measurement of yt will have to be postponed to a future hadron collider, such as the 100 TeV 
pp collider under consideration at CERN [11] and in China [12]. 

The global set of physics opportunities of such a 100 TeV collider is being explored in many 
studies. Obvious pillars of the physics program will include the study of weakly interacting thermal 
dark matter [14], the gauge sector at high energies [15], the complete understanding of the nature of the 
electroweak phase transition [16], and shedding more light on the hierarchy problem. The picture will 
rapidly evolve in the near future, also in view of the forthcoming results for the search of new physics 
at the LHC, in the experiments dedicated to the study of flavor and CP violating phenomena, and 
at the astro/cosmo frontier. Nevertheless, the continued study of Higgs properties, pushing further 
the precision of LHC measurements, exploring rare and forbidden decays, and unveiling the whole 
structure of the electroweak symmetry-breaking sector [17], will provide the underlying framework for 
the whole program. 

These goals and benchmarks are, already today, clearly defined, allowing us to start assessing 
their feasibility. For example, first studies indicate that a SM Higgs self-coupling could be measured 
at 100 TeV with a precision of 5-10% [18], for an integrated luminosity of 30 ab^ 1 , consistent with 
the current expectations [19]. Similar 100 TeV studies, for the Higgs couplings that are already under 
investigation at the LHC, are still missing. The fact that already at the high-luminosity LHC (HL- 
LHC) the couplings’ extraction will be dominated by systematic and theoretical uncertainties [20], 
makes it hard to produce today reliable predictions. One important exception, where statistics may 
still be limited at the HL-LHC, is ttH production. This measurement is also a key ingredient for the 
determination of the Higgs self-coupling. 

In this paper we will show that a precision measurement of the top Yukawa coupling y t should 
be added to the main physics opportunities of a 100 TeV hadron collider. The crucial distinction 
between this measurement at 100 TeV w.r.t. LHC energies is the potential to fully exploit the features 
of boosted objects and jet substructure [21], thanks to a large-statistics sample of highly boosted top 
and Higgs particles, as shown in Fig. 1. Our analysis will be based on the first HEPTopTagger 




Figure 1: Integrated transverse momentum distributions for the Higgs boson and top (anti-top) quark, 
in the ttH process at a 100 TeV collider (left) and the 13 TeV LHC (right). 
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application to tiH production with a Higgs decay to bottoms [6]. There are three differences between 
the original LHC analysis [6] and this 100 TeV analysis: 

First, the statistically limited LHC analysis of boosted tiH production will benefit from the hugely 
increased statistics with a 100 TeV collider energy and an integrated luminosity of few tens of ab _1 . 
For example, Fig. 1 shows that requesting pt,h > 500 GeV gives a rate of 0(1) pb, or 10M events 
with 10 ab -1 . This improved statistics also allows us to rely on a well-measured and similarly peaked 
tiZ —> ti bb signal to reduce systematic and theoretical uncertainties. In particular, we will show in 
Section 2 that the cross section ratio a(tiH)/a(tiZ) is subject to very small theoretical uncertainties, 
which, already today, are in the range of a percent. This precision will certainly improve with future 
calculations. 

Second, in Fig. 1 we see that the typical transverse momentum spectra of all particles are 
significantly harder, giving us a larger relative fraction of events with px,t > rrit and pt,h > tuh- 
The corresponding results with default taggers will be shown in Section 3. 

Finally, the recent improvement in the HEPTopTagger2 [22] and in the BDRS Higgs tagger [23] 
will allow us to avoid background sculpting and to increase the signal statistics. This last set of 
improvements will be applied in Section 4. We will find that the ratio between fiducial cross sections 
for the tiH and tiZ processes can be measured with a percent-level statistical precision. Assuming 
negligible beyond-the-SM contamination in the tiZ production process, and in view of the theoretical 
systematics discussed in Section 2, this gives a measurement of the product of yt times the H —► bb 
branching ratio, B(H —>• bb), to 1%. If the 100 TeV pp collider will be preceded by an e + e _ collider, 
B(H —>• bb) may be known to better than 1% [24, 25, 12, 26], providing a direct measurement of y t . 
If not, this result will likely provide the most precise constraint on a combination of Higgs couplings 
directly sensitive to yt- 

2. Theoretical systematics for the ttH/ttZ production rate 

It is well known that one of the key obstacles to exploiting the immense statics available at hadron 
colliders for precision measurements, is the intrinsic difficulty in performing accurate absolute rate 
predictions. This difficulty arises from several sources. On one side we have the complexity, and often 
the large size, of higher-order contributions. At NLO one is often left with uncertainties in the range of 
10% (although these can be much larger, as in the case of Higgs or bb production, and more in general 
for processes dominated by gg initial states), uncertainties that can be reduced to the few-percent level, 
but not always, only with the inclusion of NNLO effects. On the other, there are uncertainties due to 
the knowledge of initial-state parton distribution functions (PDFs), which for gg processes range from 
several percent, to order-one factors in the case of very small or very large x values. Furthermore, 
the modeling of the realistic final states, including the description of hadronization and analysis cuts, 
which are required for the comparison with experimental data, require an additional layer of theoretical 
control, which very often cannot match the available precision of fully inclusive parton-level results. 
Finally, for specific processes, there are uncertainties due to the knowledge of input parameters (e.g. 
the value of the top or bottom mass, for processes involving these heavy quarks). 

Over the past few years, we have witnessed nevertheless a staggering progress in the theoretical 
precision, addressing all aspects listed above [27]. A benchmark example is the recent completion [28] of 
the NNNLO calculation for the inclusive Higgs production in the gg H channel, which, accompanied 
by the improved determination and consistency of the gluon PDF luminosities [29], has now reduced 
to about 3%nnnlo © 3%pdf the current uncertainty on the total production rate for this milestone 
process. A similar precision has been achieved [30] in the case of the tt production rate, at NNLO. In 
view of these examples, it is premature to establish today what the theoretical systematics will be at 
the time a 100 TeV pp collider will be operating. It is reasonable to anticipate that, also thanks to the 
opportunities offered by the precise LHC measurements for the validation of theoretical calculations 
and for the improvement of the PDFs, within the next 10, 20 or 30 years all Higgs production processes 
will be known with theoretical accuracy at the level of 1% or below. 

This notwithstanding, it is extremely useful to explore observables that can help improving 
even further the precision, by providing more robust confirmation of the systematics, and enabling 
measurements where the experimental systematics can be reduced to levels comparable to the 
theoretical ones. It is in this spirit that we propose, for the study of this paper, the ratio of the ttH to 
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ttZ cross sections, performed in fiducial regions of acceptance that make them suitable for a realistic 
experimental analysis. As we shall discuss here, the theoretical understanding of these processes, 
including NLO QCD [31, 32, 33] and EW [34, 35] corrections, and including the current knowledge of 
PDFs, allows already today to support an intrinsic overall theoretical accuracy at the percent level. 
This precision will certainly be consolidated, and further improved, by future developments. Today, 
this allows to start probing the experimental prospects of the 100 TeV collider, to put in perspective the 
role of precision Higgs measurements at a such a facility, and to provide useful performance benchmarks 
for the design of the future detectors. In this Section we shall motivate such accuracy claim. What will 
be learned, can also contribute to improve the expectations for future runs of the LHC, by improving 
the predictions for the relative size of the ttH signal and its irreducible tiZ background. 

2.1. Total rates and ratios 

The main observation motivating the interest in the study of the ttH/ttZ ratio is the close analogy 
between the two processes. At leading order (LO) they are both dominated by the gg initial state, with 
the H or Z bosons emitted off the top quark. The qq -initiated processes, which at the 100 (13) TeV 
amount for <) 10% 30%) of the total rates, only differ in the possibility to radiate the Z boson from 

the light-quark initial state. The difference induced by this effect, as we shall see, is not large, and is 
greatly reduced at 100 TeV. At NLO, renormalization, factorization and cancellation of collinear and 
soft singularities will be highly correlated between the two processes, since the relevant diagrams have 
the same structure, due to the identity of the tree-level diagrams. This justifies correlating, in the 
estimate of the renormalization and factorization scale uncertainties, the scale choices made for ttH 
and ttZ. The uncertainties due to the mass of the top quark are also obviously fully correlated between 
numerator and denominator. Furthermore, due to the closeness in mass of the Higgs and Z bosons 
and the ensuing similar size of the values of x probed by the two processes, and given that the choice 
of PDFs to be used in numerator and denominator in the scan over PDF sets must be synchronized, 
we expect a significant reduction in the PDF systematics for the ratio. Finally, the similar production 
kinematics (although not identical, as we shall show in the next Section), should guarantee a further 
reduction in the modeling of the final-state structure, like shower-induced higher-order corrections, 
underlying-event effects, hadronization, etc. 

The above qualitative arguments are fully supported by the actual calculations. All results are 
obtained using the MadGraph5_aMC@NLO code [36], which includes both NLO QCD and EW 
corrections. The default parameter set used in this study is: 


Parameter 

value 

Parameter 

value 

Gu 

1.1987498350461625 ■ 10~ 5 

ni f 

5 

m t 

173.3 

Vt 

173.3 

mw 

80.419 

m z 

91.188 

m H 

125.0 

a -1 

128.930 


MSTW2008 NLO [37] is the default PDF set and gp. = Of = go = S/efinai states m Tj/ 2 is the default 
for the central choice of renormalization and factorization scales, where tot./ is the transverse mass 
of the final particle /. This scale choice interpolates between the dynamical scales that were shown in 
Ref. [31] to minimize the pp dependence of the NLO/LO ratios for the top and Higgs spectra. 



a(ttH)[pb\ 

<j(ttZ) [pb] 

(7 (ttH) 
a(ttZ) 

13 TeV 

O A 7c;+ 5.79%+3.33% 
u,4UO -9.04%—3.08% 

n 70 c+9.81%+3.27% 

U.(00 n 2 % 3 i 2% 

n er\R+2A5%+0.525% 
U.OUO 3 66 % 0 319% 

100 TeV 

oo n+7.06%+2.17% 
-8.29% —2.18% 

r 7 o+8.93%+2.24% 
9.46% — 2.43% 

n coc + 1.29%+0.314% 
U.OOO 2 02%0 14 7% 


Table 1: Total cross sections a(tiH) and a(ttZ) and the ratios a(ttH)/cr(tiZ) with 

NLO QCD corrections at 13 TeV and 100 TeV. Results are presented together with the 
renormalization/factorization scale and PDF+a s uncertainties. 
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a(ttH)[ pb] 

a(ttZ) [pb] 

cr(ttH ) 
a (ttZ) 

13 TeV 

MSTW2008 

CT10 

NNPDF2.3 

r» a yr +5.79%+2.02% 
9.04%—2.50% 

O A Kr\~\~5.70yo-\-6.00% 
U.^OU _ 8 80% _5 34% 

fl A 7Q+5-26%+2.22% 
U '^' U -8.58%-2.22% 

r» i-7o r+9.81%+1.93% 
u -' 00 _ll .2%—2.39% 

Pi 1A 1 +9-50%+5.91% 
10.9%—5.29% 

0 771 +8.97%+2.16% 
U -' ' 1 —10.6%—2.16% 

Pi ^n^+2.45%+0.216% 
U - UUU _3. 66 %_ 0 .2 49 % 

0 £n7+^-34%+0.672% 
U.OU / _3 47%_o.675% 

Pi ftnQ+2-23%+0.205% 
U.UUJ _ 3 41% _ Q 2Q5% 

100 TeV 

MSTW2008 

CT10 

NNPDF2.3 

qq n+7.06%+0.94% 

-8.29% — 1.26% 
on 4 +6.87%+2.29% 
oz -^-8.11%-2.95% 
qq 9 +6.62%+0.78% 
°°' Z -6.47%-0.78% 

r*7 q+8.93%+0.90% 

01 ’ a -9.46%-1.20% 
cc r+8.73%+2.16% 

OO.O —g 27% —2.78% 

q+ 7-62%+0.75% 
OU.a 7 29%—0.75% 

n coc + l-29%+0.0526% 

U.OOO — 2 . 02 %_o .0758% 
n roq+1.27%+0.189% 
U.OOU—i gg %—q 260% 
n ro/| +1.29%+0. 0493% 
U.OOU -2 ,oi% 0 .0493% 


Table 2: Results with NLO QCD corrections at 13 TeV and 100 TeV, using three different sets of PDF. 
Results are presented together with the renormalization/factorization scale and PDF uncertainties. 
Contrary to Table 1, the as systematics is not included here. 


We start by discussing the results at the LO in the EW effects. The scale variation is performed 
over the standard range 0.5/xo < Pr,f < 2/io, with pn and pf varying independently. Both scale and 
PDF choices are correlated between numerator and denominator when taking the ratios. The resulting 
scale and MSTW 2008NLO PDF +a s uncertainties, for the total cross sections of the individual 
processes and of for the ratio, are shown in Table 1. Notice that the scale uncertainty of the individual 
processes, in the range of ±7 — 10%, is reduced to ±1.5% (±3%) for the ratios at 100 (13) TeV. The 
PDF variation is reduced by a factor close to 10, to the few permille level. 

To corroborate the great stability of the ratios, we also consider different PDF sets, showing in 
Table 2 the results obtained using the following LHAPDF 5.9.1 [38] sets: MSTW2008 NLO [37], CT10 
NLO [39] and NNPDF2.3 NLO [40] (in this case, we only consider the PDF variation, and not the 
a s systematics). While the overall envelope of the predictions for the individual rates includes a ±5% 
range, the ratio uncertainty due to the PDFs remains at the few permille level. 

We explore further variations in our default parameter set in Table 3. There, we remove the PDF 
uncertainties, which are practically unaffected by these parameter changes. Choosing the fixed value 
Po = m t + m.H,z /2 for the central choice of the renormalization and factorization scales, modifies the 
ratio cr(tiH)/a(ttZ ) by 1% — 1.5%, consistent with the range established using the dynamical scale. 

For mt, we consider a variation in the range of m* = 173.3 ± 0.8 GeV. We notice that a(ttH) is 
practically constant. This is due to the anti-correlation between the increase (decrease) in rate due to 




cr(ffiL) [pb] 

a(ttZ)[pb\ 

c t(UH) 
a(ttZ) 


default 

po =m t + m H ,z /2 

0 475 +5 - 79 ^ 

0 529 +5 ' 96% 

U.OZy_g 42 % 

0.785±? i 8 |’| 

0-885t?i 9 ^ 

0-606+3[66% 
0-597 tlUl 

13 TeV 

nit = DtV = 174.1 GeV 

0 474 +5 74% 

0 77q+9-76% 
U./ (o_ 112% 

0-614+;^: 


m t = y t v = 172.5 GeV 

0 47^+ 5 - 81 ^ 
U - 4,S - 9 .05% 

0.795±® i *l|| 

0-597+;^ 


m H = 126.0 GeV 

0 464 +5 ' 80% 

u.4ou_g 04% 

0 785 +9 ' 81 ^ 

0.593+;^: 


default 

po = m t + m H ,z /2 

qq q+7.06% 
00 *_ 8.29% 

QQ n+9.76% 
° y ‘ U -9.57% 

E 7 0+8-93% 
0I ^-9A6% 
cn o+10.9% 
°'- Z - 10 . 6 % 

0.585+;^ 

0 580 +1 161 ^ 
U.OoU —i go% 

100 TeV 

m t = yt.v = 174.1 GeV 
m t = y t v = 172.5 GeV 

m H = 126.0 GeV 

33 n+ 7 - 01 % 
oa.a _ 8 27 % 

qq 7 + 6 . 99 % 

°°-*-8.31% 
qq <)+7.04% 
°°' Z -8.28% 

9+8.90% 

- Z -9.42% 

58 fi+ 8 - 93% 

00 '°-9.46% 

C 7 qH- 8.93% 
°‘- y -9.46% 

0 592 +1 ' 27% 
u.oaZ -2 00 % 

0 576 +1 27 ^ 

U.O I O — i 9 9 % 
n c 7 c + 1.25% 

O. 0(0 i 95% 


Table 3: Results with NLO QCD corrections at 13 TeV and 100 TeV by varying some parameter 
values. Results are presented together with the renormalization/factorization scale uncertainties. 
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a(ttH)[ pb] a(ttZ)[ pb] a \ uZ ) 

13 TeV 

m t = 174.1 GeV 

mt = 172.5 GeV 

0.3640 0.5307 0.6860 

0.3707 0.5454 0.6800 

100 TeV 

m t = 174.1 GeV 

m t = 172.5 GeV 

23.88 37.99 0.629 

24.21 38.73 0.625 


Table 4: LO results at at TeV and 100 TeV, keeping the top Yukawa coupling y t v = 173.3 GeV. 


pure phase-space, and the decrease (increase) in the strength of yt, when the top mass is lower (higher). 
The ttZ process is vice versa directly sensitive to mt at the level of ±1.5% over the ±0.8 GeV range, 
and this sensitivity is reflected in the variation of the cross-section ratio. We notice, however, that if 
we kept the value of yt fixed when we change mt, the dynamical effect on the rate would be totally 
correlated, and the ratio would remain constant to within a few permille, as shown in Table 4. This 
shows that the ratio is only sensitive to the strength of yt, and only minimally to the precise value of 
m t . 

Finally, we observe a ~ 2% shift in <j{ttH) (and therefore in the ratios) when mu is changed by 
1 GeV, which is a gross underestimate of the precision with which the Higgs mass is [43] and will soon 
be known. 

The effect of the NLO EW corrections in the a(mz) scheme is shown in Table 5. The shift 
in the ratio with respect to the pure NLO QCD result is of the order of 2%. For reference, we 
also provide the results in the scheme. In this scheme, we use a -1 = 132.50699632834286 and 
Gfj. = 1.166390 • 10 -5 . The overall difference from the a(mz ) scheme for the individual rates is at the 
percent level, and at the permille level for the ratios. We conclude that, once the known NLO EW 
effects are incorporated, the residual uncertainty of the cross-section ratio due to higher-order EW 
corrections should be significantly below the percent level. 

Before closing this discussion of the total rates, we remark on the relation between the predictions 
for the cross section ratios at LO and at NLO. Since at LO the renormalization and factorization scales 
only appear in the PDFs and in a s (yu), and given that the numerical values of the scales is very similar 
(as a result of mn — mz <C 2 mt ± mn,z), the LO ratios come with an unreliably optimistic estimate 
of the scale and PDF uncertainty. It is only at NLO that, through the introduction of the appropriate 




a(mz) scheme 

a(ttH)[ pb] a(ttZ)\ pb] 

a(ttH) 

cr(ttZ) 

G, 

a(ttH)[ pb] a 

scheme 

(tiZ)[ pb] 

a(ttH) 

o{ttZ) 


NLO QCD 

0.475 

0.785 

0.606 

0.462 

0.763 

0.606 


e>(a|a 2 ) Weak 

-0.006773 

-0.02516 


0.004587 - 

0.007904 


13 TeV 

e>(a|a 2 ) EW 

-0.0045 

-0.022 


0.0071 

-0.0033 



NLO QCD±Weak 

0.468 

0.760 

0.617 

0.467 

0.755 

0.619 


NLO QCD±EW 

0.471 

0.763 

0.617 

0.469 

0.760 

0.618 


NLO QCD 

33.9 

57.9 

0.585 

32.9 

56.3 

0.585 


0(a |a 2 ) Weak 

-0.7295 

-2.146 


0.0269 

-0.8973 


100 TeV 

0(a |a 2 ) EW 

-0.65 

-2.0 


0.14 

-0.77 



NLO QCD±Weak 

33.1 

55.8 

0.594 

32.9 

55.4 

0.594 


NLO QCD±EW 

33.2 

55.9 

0.594 

33.1 

55.6 

0.595 


Table 5: Effect of the EW NLO corrections, in the a(mz) and G^ schemes, at 13 TeV and 100 TeV. 
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: 
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Figure 2: Scale and PDF systematics of ratios of integrated px spectra for different observables, at 
100 TeV. From left to right: px of the boson, px of the top quark, px of the tt pair. 


kinematical factors Q 2 in the renormalization logarithms log(^|j/Q 2 ), the relevant differences between 
the scale behavior of two processes are first exposed. At NLO one also encounters classes of (IR- and 
UV-finite) diagrams that differ between the two processes, and contribute to finite NLO terms that 
cannot be estimated using scale-variation arguments. For example, light-quark loops can couple to the 
Z boson, but not to the Higgs. 

As always when using scale-variation tests to assess the theoretical systematics, there is therefore 
no guarantee that yet higher-order corrections will not exceed the range predicted by those estimates. 
For a measurement as important as the extraction of the top Yukawa coupling, it is reasonable to 
demand that the uncertainty estimates we provided here be confirmed by a full NNLO calculation, 
something that will certainly be possible over the next few years. Nevertheless we believe that the 
studies presented here provide a rather compelling case to argue that a precision at the percent level 
is reasonable. 

2.2. Kinematical distributions 

Any experimental analysis, and in particular the boosted approach that we employ in this work, 
will restrict the phase-space available to the final states. To preserve the precision in the theoretical 
prediction of the ratio of total tiH and ttZ cross sections, it is crucial to ensure that the reduction in 
systematics uncertainties carries over to the description of final states after kinematical cuts have been 
applied. We present here a summary of our studies at 100 TeV, focused on the kinematical distributions 
most relevant for our studies, and limited to main sources of uncertainty (scale and PDF). The results 
for other distributions and for other systematics (top mass, EW scheme), at 100 and at 13 TeV, lead 
to similar results, and are available upon request. 

We show in Fig. 2 the ratio of the integrated px spectra of various final-state objects X: 
a[ttH](px,x > Pt, min) /cr[ttZ](px,x > Pt. min)- On the left, X = H(Z) for the ttH (ttZ ) process. 
In the middle, X = t and on the right X is the tt system. We normalize the ratios to 1 at px. min = 0, 
so that the resulting uncertainties correspond to the systematics in the extrapolation of the ratio of 
differential distributions to the ratio of the total rates. The three upper panels show that the ratios 
are not a constant, and can change buy up to 20% up to px = 500 GeV. The relative uncertainties, 
separately for the scale and PDF variation (MSTW2008 NLO set), are shown in the lower plots. The 
scale uncertainties reach a value of ±2% for the boson px spectra, ±1% for the top, and ±3% for the 
px of the tt pair. The PDF uncertainties remain well below the percent level throughout. 

These results imply that the relative shapes of the px spectra can be controlled with a precision 
that remains consistent with the overall goal of a percent-level extraction of the relative rates. There is 
no doubt that future NNLO calculations of both processes will improve this even further. Very precise 
measurements of the shape of the Z boson spectra in ttZ events using e.g. the very clean leptonic Z 
decay will also help confirming the accuracy of the predicted px spectra and reduce a possible left-over 
uncertainty. 
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Just like at the LHC, the ttH production process can be studied for a variety of Higgs decay channels. 
We collect in Table 6 the event rates for potentially interesting Higgs decays combined with ttH 
production, for an integrated luminosity of 20 ab 1 at 100 TeV. These numbers include the branching 
ratio for the mixed lepton-hadron tt —> £u(+ jets decay (i = e, p), in addition to the relevant Higgs 
branching ratios. 

Considering that analysis cuts and efficiencies will typically reduce these rates by a further factor 
of 10 or more, it is clear that the otherwise very clean H —► \t does not have the minimum number 
of 10 4 events, required to aim for a 1% target precision. In the case of H —> 77 (see also [44]), we 
considered a simple parton-level analysis, implementing basic cuts such as: 

pT.j.b^j ^ 25 GeV , ^ 2.5 , ^Rjj^b,bj ^ 0*4 

Pr,t > 20 GeV , \ru\< 2.5 (1) 

These leave around 5 • 10 4 events with 20 ab - , while the ttyy background, subject to a |m 77 — 1251 < 
5 GeV cut, is almost a factor of 10 smaller. On the other hand, detection efficiencies, such as those 
related to lepton or photon isolation and to b tagging, make this channel borderline for a 1 % statistical 
accuracy, and call for a dedicated study including realistic projections of detector performance. The 
H —>• 2£2n final state has a potentially interesting rate, which may deserve a separate study. 

Given the extraordinary rate for the H -A bb final state, and following the original LHC analysis [ 6 ], 
we focus on this channel, 

pp -» ttH -> ( bjj) (Mu) (bb), (Mu) (bjj) (bb) . (2) 

The leptonic top decay guarantees the triggering and reduces multi-jet combinatorics. The leading 
backgrounds are: 

pp —> tt bb, the main irreducible QCD background 
pp —> tiZ, including the Z- peak in the distribution 

pp —> tf+jets with fake-bottoms tags 

Additional backgrounds like W+jets will be small and do not lead to dangerous kinematical features for 
our analysis [ 6 ]. The analysis strategy based on boosted top and Higgs decays is extremely simple [ 6 ], 

(i) an isolated lepton 

(ii) a tagged top without any 6 -tag requirement 

(iii) a tagged Higgs with two 6 -tags inside 

(iv) a continuum 6 -tag outside the top and Higgs fat jets 

The mbb distribution will provide us with simple sidebands to control the tibb and ft+jets backgrounds, 
and a second mass peak from the tiZ mass peak. We discuss the unfortunate need for the continuum 
6 -tag below. The simplicity of our analysis will allow us to efficiently control systematics. 

For simplicity, all Monte Carlo event samples are generated at leading order. The main effects 
from the available higher order predictions of the ttH signal [31, 32], the tiZ background [33], the 
tibb background [42], and the tf+jets background [45] are discussed separately in Section 2, so for the 
signal-background analysis we leave them out. We use MadGraph5 [46] with NNPDF23 parton 
densities [47], showering and hadronization via Pythia8 [48] and the fast detector simulation with 
Delphes3 [49, 50]. 


H -a M 

H —l► 77 

H 212u 

H -+bb 

2 . 6 - 10 4 

4.6- 10 5 

2.0 • 10 6 

1 . 2 - 10 8 


Table 6 : ttH event rates for various Higgs decay modes, with 20 ab 1 at 100 TeV, assuming 
ti —> fp+jets. Here and for Higgs decays, £ can be either an electron or a muon. 
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Figure 3: Reconstructed for the leading-J substructures in the fat Higgs jet. We require two 
6-tags inside the fat Higgs jet (left) and an additional continuum 6-tag (right). The event numbers are 
scaled to C = 20 ab -1 . 


At the generator level we require pr,j,b,e > 10 GeV and A Rjj t bb,ji > 0.1. The tt+jets background 
is generated as one hard jet with prj > 100 GeV at the hard matrix element level. We do not 
consider merged samples since we found that the influence of tt + 2j to our analysis is negligible. After 
generator cuts we start with a signal cross section of 4.2 pb. Associated ttZ production yields 1.2 pb. 
The continuum ttbb background counts 121 pb and is at this stage dominated by tt+jets with 2750 pb. 

Delphes3 provides isolated leptons as well as parton-level 6 -quarks needed for the tagging 
procedure later-on. Leptons have to pass a minimum pt/ > 10 GeV. For their isolation we demand 
a transverse momentum ratio (isolation variable) of I < 0.1 within A R < 0.3. Finally, we use the 
energy flow objects for hadrons to cluster via the Cambridge/Aachen (C/A) jet algorithm [51]. The 
jet clustering and the analysis are done with FastJet3 [52], a modified BDRS Higgs tagger [23, 6] 
and the HEPTopTagger2 [22]. For all 6 -tags we require a parton-level 6 -quark within A R < 0.3. 

First, we require one isolated lepton with \yg\ < 2.5 and px,c > 15 GeV. For the top tag [53, 54, 55], 
we cluster the event into fat C/A jets with R = 1.8 and prj > 200 GeV. Provided we find at least 
two fat jets we apply the HEPTopTagger2 with the kinematic requirement \y'p < 4. The recent 
significant update of the HEPTopTagger2 relies on two additional pieces of information to achieve 
a significant improvement [22]. One of them is A-subjettiness [56], which adds some sensitivity to the 
color structure of the event. The other is the optimalR mode, which based on a constant fat jet mass 
reduces the size of the fat jet [57] to the point where the fat jet stops containing all hard top decay 
subjets. This minimal size can also be computed based on the transverse momentum of the fat jet. 
Since the signal and all considered backgrounds include a hadronic top quark, changing the top tagging 
parameters results only in an overall scaling factor. In this analysis we do not cut on the difference 
between the expected and the found optimal radius because the initial fat jet size is already chosen to 
fit the expected transverse momenta. To have a handle on the QCD multi-jet background, we place a 
mild cut on the filtered A-subjettiness ratio T3/T2 < 0.8 which can be tightened at the cost of signal 
efficiency if desired. After identifying the boosted top we remove the associated hadronic activity and 
apply a modified BDRS Higgs tagger to fat C/A jet(s) with R = 1.2, [gf*^ \ < 2.5, and pr.j > 200 GeV. 
Our decomposition of the fat jet into hard substructure includes a cutoff of m su b > 40 GeV for the 
relevant substructure and a mass drop threshold of 0.9. The hard substructures are then paired in all 
possible ways and ordered by their modified Jade distance, 

J = PT,iPr I 2(Ai?i2) 4 . (3) 

The leading pairing we filter [23] including the three hardest substructures, to allow for hard gluon 
radiation. For consistency we require a reconstructed transverse momentum above 200 GeV. Within 
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mbb G [100,150] GeV 

2 6-tags 

3 b-tags 

ratio 

ttH 

2.4E+5 

6.4E+4 

1/3.8 

ttbb 

1.2E+6 

2.4E+5 

1/5.0 

tt+ jets 

1.9E+6 

3.8E+4 

1/50 

ttz 

2.3E+4 

4.9E+3 

1/4.7 


Table 7: Event rates assuming an integrated luminosity of 20 ab 1 


this Higgs candidate we ask for two b- tags, assuming a global tagging efficiency of 50% and a mis- 
tagging probability of 1% for all jets within y :) | < 2.5 and prj > 30 GeV. As we can see in the 
left panel of Fig. 3, the tt+jets and ttbb backgrounds are of similar size at this stage. Moreover, the 
analysis sculpts the backgrounds towards mbb ~ 100 GeV. 

To simplify the background composition and to avoid the strong background sculpting it turns out 
that a third, continuum 6-tag is useful. We target the decay jet of the otherwise leptonically decaying 
top by removing the top and Higgs constituents from the event and then clustering the remaining 
hadronic structure into C/A jets with R = 0.6 and pr,j > 30 GeV. For one of them we require a 6-tag 
within \yb\ < 2.5 and a angular separation A Rbj > 0.4 from all other jets, now including the top and 
Higgs decay products. 

The effect of this third 6-tag becomes clear in the right panel of Fig. 3. We are now dominated by 
the continuum ttbb background. The corresponding event rates for an integrated luminosity of 20 ab -1 
are given in Tab. 7. While the light-flavor tt+jets background is now suppressed well below the leading 
ttbb background it is still of the same size as the Higgs signal, which means we still need to include it 
in our analysis. 

4. Updated BDRS tagger 

The two improvements of the HepTopTagger2 can also be added to the BDRS Higgs tagger [23, 
52, 58]. The decay H —> 66 will typically contain two hard substructures, so using V-subjettiness the 
characteristic parameter t 2 /ti has to be small. The correlations between the reconstructed masses 
and the ratio t 2 /ti of the filtered fat jets in Fig. 4 indicate that a cut t 2 /ti < 0.4 not only reduces 
the backgrounds but additionally leads to narrower and better-defined mass peaks for the Higgs and 
^-decays as shown in Fig. 5. 

In the optimalR version of the BDRS tagger we reduce the size of the Higgs fat jet candidate. Aside 
from reduced underlying event and pile-up this minimizes the combinatorics in the mbb reconstructions. 
As for the top case [22] we shrink the fat jet radius in steps of 0.1 as long as the jet mass does not 
drop below mj < 0.8mj iOr i g relative to the originally tagged Higgs jet with R = 1.2. We can extract 



m rec [GeV] m rec [GeV] m rec CGeV] m rec [GeV] 


Figure 4: Correlation between the reconstructed mass m Tec and the V-subjettiness ratio t 2 /ti of the 
filtered Higgs candidate fat jet for the signal and background samples. The event numbers are scaled 
to C = 20 ab -1 . 
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Figure 5: Reconstructed nibb of the Higgs and Z candidates in ttH and ttZ production with the default 
BDRS tagger (left) and after using optimalR and the V-subjettiness cut 72 /ti < 0.4 (right). In the 
right panel we include the fitted Crystal Ball functions. The event numbers are scaled to C = 20 ab _1 . 


the expected value of A Rbb from a fit to Monte Carlo simulations, 


A R 


(calc) 

bb 


250 GeV 
Pt, sit 


(4) 


This supports the choice of R = 1.2 for the C/A jet clustering for the Higgs Tagger requiring transverse 
momenta of pr > 200 GeV. Unfortunately, for ttH production the relation between the expected and 
the measured values of A Rbb does not significantly improve the analysis. However, the mass difference 
between the Higgs and the Z boson leads to a shifted peak in the A Rbb — Ai?^ alc ) distribution for 
ttZ. This shift allows for an additional reduction of ttZ if desired. In the final result shown in the 
left panel of Fig. 6 we include a triple 6-tag, the V-subjettiness variable 72 /ti , and a modified fat 
jet radius for the Higgs candidate. Since the background region m^b £ [160, 300] GeV is smooth and 
untouched by any signal, we can use it to subtract the QCD continuum from the combined ttH and 
tiZ signal. If the soft regime mbb £ [0, 60] GeV can be useful in the same way needs to be checked by 
a full experimental analysis. 


For the signal region mbb £ [104,136] GeV we arrive at a signal-to-background ratio around 
S/B ss 1/3 and a Gaussian significance S/VB = 120, assuming an integrated luminosity of 
C = 20 ab -1 . The error on the number of nominally Ns = 44700 signal events is given by two 
terms. First, we assume that we can determine N$ from the total number of events Ns + Nb using 
a perfect determination of Nb from the side bands. Second, the side band mbb £ [160, 296] GeV 
with altogether N s j^e = 135000 events and a relative uncertainty of 1/v^Vside introduces a statistical 
uncertainty A N B , altogether leading to 


A N s 


(y /-Vs + Nb^J + (A Nb) 

(■\/Ns + N B y + 


-11/2 




1/2 


= 0.013 A/s • 


(5) 


For the Yukawa coupling this translates into a relative error of around 1%. The first term alone would 
give A Vs = 0.010 Vs- 


In the right panel of Fig. 6 we show a combined fit to the Z and Higgs peaks assuming a 
perfect background subtraction. A combined analysis of both peaks (with known masses) serves as a 
check of the jet substructure techniques [23, 6] and as a means to reduce systematic and theoretical 
uncertainties, as discussed in Section 2. Given separate simulations for the Higgs and Z peaks, we 
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Figure 6 : Left: Reconstructed nibb for the leading- J substructures in the fat Higgs jet. We require two 
6 -tags inside the fat Higgs jet and a continuum 6 -tag. Unlike in Fig. 3 we apply an JV-subjettiness cut 
and use an optimalR version of the BDRS tagger. Right: Double-peak fit assuming perfect continuum 
background subtraction. The event numbers are scaled to C = 20 ab -1 . 


can fix the shape of both distributions by fitting a Crystal Ball function [59] to each of them as 
done in the right panel of Fig. 5. For these fits we limit the exponent of the non-Gaussian tails to 
50. In addition we fix the peak positions accounting for a shift due to losses in the reconstruction. 
Their linear combination allows us to model the background subtracted mass distribution. In the 
combined fit we keep all shape parameters fixed and allow only for separate scaling factors of each 
peak. From the double Crystal Ball function we finally receive the relative size of the two peak areas 
Nh/Nz = 2.80 ± 0.03. Using the combined fit therefore allows us to probe the top Yukawa coupling 
with a statistical precision of ~ 0.5%. Given the discussion of Section 2, this precision can be eventually 
matched by the theoretical systematics, assuming no new physics affects ttZ production beyond the 
percent level. It remains to be explored to which extent the future detectors can benefit from the 
potential cancellations of experimental systematics in the measurement of the Nh /Nz ratio. 

5. Outlook 

The top Yukawa coupling is one of two key parameter required for the understanding of the Higgs 
potential, and it is a crucial ingredient to the measurement of the Higgs self-coupling. At the LHC 
its determination will be limited to around A yt/yt ~ 10 % because of statistical as well as theoretical 
uncertainties [9, 5]. At a 100 TeV hadron collider the increased statistics will significantly improve 
this measurement. 

We proposed here to measure the top Yukawa coupling using the decay H —> 66 in the boosted 
phase space regime. Our simple analysis strategy [ 6 ] relies on a trigger lepton and two fat jets, one from 
the hadronic Higgs decay and one from the hadronic top decay. The mbb distribution will show a clear 
peak from the Higgs signal as well as a similarly large peak from the Z background. The continuum 
side band and the second peak offer two ways to control the backgrounds as well as the translation 
of the ttbb rate into a measurement of the Yukawa coupling. We find that a measurement of the 
top Yukawa coupling to around 1% should be feasible at 100 TeV collider energy with an integrated 
luminosity of 20 ab -1 . This is an order of magnitude improvement over the expected LHC reach, with 
significantly improved control over the critical uncertainties. 

There exist additional, complementary handles on the uncertainties. For example, the H —> 77 
decay could allow a direct measurement of the ratio of branching ratios B(H —► 77 )/B(H —► 66 ). 
It would serve as be complementary, although indirect, probe of the ttH coupling. Furthermore, 
H —► 212v could also be interesting, since there is enough rate to explore the regime pt,h tuh, 
which, particularly for the e ± pTvv final state, could prove particularly clean. 
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